Inhomogeneous substructures hidden in random networks 
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We study the structure of the load-based spanning tree (LST) that carries the maximum weight 
of the Erdos-Renyi (ER) random network. The weight of an edge is given by the edge-betweenness 
centrality, the effective number of shortest paths through the edge. We find that the LSTs present 
very inhomogeneous structures in contrast to the homogeneous structures of the original networks. 
Moreover, it turns out that the structure of the LST changes dramatically as the edge density of an 
ER network increases, from scale-free with a cutoff, scale-free, to a star-like topology. These would 
not be possible if the weights are randomly distributed, which implies that topology of the shortest 
path is correlated in spite of the homogeneous topology of the random network. 

PACS numbers: 89.75.Hc, 89.75.Da, 89.75. Fb, 64.60.-i 



Complex network theories have attracted much atten- 
tion because of the usefulness to analyze diverse com- 
plex systems in the real world P, 0. The most repre- 
sentative measure characterizing a network is the degree 
distribution, P(k), which indicates probability for a ver- 
tex to be directly connected to fc neighboring vertices 
with edges. Specifically, while the power-law distribu- 
tion, P(k) ~ k _7 , is widely found in the most of real- 
world networks including technological, biological, and 
social networks, such as the Internet [3j , the World Wide 
Web Q, the metabolic networks the protein inter- 
action networks |(|, and the coauthorship networks 0, 
there also exist homogeneous networks, such as the US 
highway network and the US power-grid network, which 
are explained by the bell-shaped and exponential degree 
distributions. Recently it has been claimed that an ap- 
parent scale-free network can be originated from a sam- 
pling result of a underlying homogeneous network 

While the degree distribution gives valuable knowledge 
of local structures of networks around us, it is also neces- 
sary to know global structures of networks to understand 
dynamics on the networks properly. The information 
transport between two vertices occurs along an optimal 
path connecting them, defined as the path minimizing 
the total cost j^, fToL flT| . which is usually determined 
by using the global knowledge of the network. For in- 
stance, full information of connections in the network is 
required to determine a shortest path defined as a path 
consisting of a minimum number of edges, which would 
be an optimal path if the costs of all edges are identi- 
cal. The scale-free network has been revealed to have 
very inhomogeneous shortest path topology so that one 
can find extremely important vertices or edges that huge 
number of shortest paths are passing through, which has 
been supported by the power-law distribution of the be- 
tweenness centrality (BC) IT3l and the existence of the 
transport skeleton structure [l3j ] that makes it possible to 
understand the origin of the difference between the BC 
exponent classes of real-world networks lip. ITpt a nd the 
universal properties of the fractal scaling~[l4l Il5|. How- 
ever, in the non-scale-free networks, even though it has 
been known that there are no such vertices or edges used 



heavily in the shortest paths, the topology of the shortest 
paths has not been intensively studied so far. 

Our main interest is to find out how the topology of 
shortest paths is correlated with the network topology in 
the Erdos-Renyi (ER) random network model , where 
two arbitrary vertices in the network are randomly con- 
nected to each other by an edge with a given probability 
p, which gives the Poisson degree distribution. In order 
to systematically study the shortest path topology, it is 
necessary to treat the network as a weighted network in 
which the contribution of the shortest paths on each edge 
is assigned as the weight of the edge. We use the edge- 
betweenness centrality (edge-BC) [Pn . ll8l.IT9l ] to represent 
the contribution of the shortest paths on each edge of the 
network, which is a convenient quantity counting the ef- 
fective number of shortest paths through the edge and 
thereby gives the average traffic though the edges. The 
edge-BC of an edge between vertices i and j is the 
total contribution of the edge on the shortest paths be- 
tween all possible pairs of vertices, which is defined as 
follows: 



c{m,n;i,j) 
' c(m,n) 
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where c(m,n;i,j) denotes the number of shortest paths 
from a vertex m to n through the edge e^, and c(m,n) 
is the total number of shortest paths from m to n. 

In this weighted networks, one useful way to study the 
spatial correlation of the weight is to investigate the load- 
based spanning tree (LST) |l3| . which consists of a set of 
selected edges to maximize its total weights, which cor- 
responds to the skeleton of the network • From the 
degree distribution of the LST, we can check whether the 
spatial distribution of the weights is correlated with the 
original network topology. If the weights are randomly 
distributed on the edges of the network, the degree distri- 
bution of the original network would be preserved in its 
LST [2(|. On the other hand, if there exists significant 
topological correlation in the distribution of the weights 
on the edges, the degree distribution is expected to show 
a large deviation from the Poisson distribution, the de- 
gree distribution of the ER network. In this paper, we 
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FIG. 1: (a) An illustration of topologies of the LST (solid 
line) and the original ER network (dotted line) with 100 
vertices and p — 0.1. (b) The degree distributions of the 
LSTs for various constructions of the ER networks with 
N = 4000 vertices and connection probability of edges p = 
0.004,0.014,0.024. The degree distributions are guided by the 
curve-fits (dotted lines) to P(k) ~ fe" 7 with 7 = 1.47, 1.97, 
4.20 for p = 0.004, 0.014, 0.024, respectively, (c) The degree 
distribution of the LST when the weights are randomly redis- 
tributed on the edges. The Poisson degree distribution with 
average degree 2 of trees (dotted line) is given for compari- 
son. All data points are obtained from the averages over 256 
network ensembles. 



investigate the structure properties of the LST of the 
ER model in this respect. We find that the LSTs show 
very inhomogeneous structures in contrast to the homo- 
geneous structures of the original networks. It is found 
that the degree distribution of the LST shows rich char- 
acteristics depending on the connection probability and 
the size of the ER network, which turns out to be very 
different from the Poisson distribution. 

Figure ^a) shows an illustration of the inhomoge- 
neous LST structure obtained from the ER network with 
N = 100 vertices and p = 0.1, in which the hub ver- 
tices having a significant number of degree are found. 
The detailed characteristics of degrees can be found in 
Fig. ^b) which displays the degree distributions of the 
LSTs obtained from the ER networks generated with var- 
ious connection probabilities. It is the most interesting 
feature that the right-skewed degree distributions are ob- 
served in the LSTs for the wide range of connection prob- 
ability p of the ER network whose degree distributions 
follow the narrow Poisson distribution. For the examined 
LSTs, it is found that a power-law with a cutoff fits well 
to the degree distribution of the LST, where the expo- 
nent and the cutoff degree depend on p. The emergence 
of these inhomogeneous degree distributions in the LSTs 
indicates that there exist non-negligible correlations be- 



tween the weights of neighboring edges sharing a common 
vertex at their ends, since a vertex shared by the edges 
having higher weights becomes a hub in the LST. If we re- 
move the correlation by the random redistribution plj of 
the weights on the edges, the degree distributions of the 
LSTs become the Poisson distribution as expected [see 
Fig. njc)] . These imply that the shortest paths, which 
give the weights of its constituting edges, are not ran- 
domly distributed on the ER network but strongly cor- 
related enough to generate an inhomogeneity in spite of 
the homogeneous topology of the ER network. 

The topological correlation of the shortest path that 
gives rise to the inhomogeneous LST structure can be 
specified by the correlation between the degree of the ver- 
tex and the weights of the edges connected to the vertex. 
In order to find out more about the spatial correlation 
in the distribution of the weights, we measure the av- 
erage weight rank Rk, an averaged value of weight-rank 
r over the edges attached to a vertex having degree k. 
The rank of the edge between i and j is graded for its 
weight Wij, i.e., the largest weight gives r = 1, the second 
largest weight gives r = 2, and so on. Mathematically 
Rk is defined as follows: 

«* = (t^t £ ££ r «°A ( 2 ) 

where and \Vk\ are the set of vertices having degree k 
and the number of those vertices, respectively, is the 
adjacency matrix element; a.y — 1 if i and j is connected 
and ciij = otherwise, and (...) denotes an average over 
network ensembles. Consequently, the small (large) value 
of R indicates that a vertex has the edge with high (low) 
ranks. The reason why we attach great importance to 
Rk is that it gives an insight into how the structure of 
the network changes in the LST because the edges of 
the network are picked in the rank order for the LST. 
While the average value of the weights can give similar 
information, since the ER network has a rather homoge- 
neous distribution of weight values, the average weight 
rank shows more clear changes depending on the spatial 
distribution of the weights as compared with the average 
value of the weights. In counting ranks, we do not admit 
a tie in ranks for the edges with the same weights but 
assign ranks according to random priority if the values of 
weights are the same. However, we note that averaging 
over the network ensembles removes the dependence on 
the counting method of the tie in ranks. 

In order to figure out how the weights are distributed 
on the network, we compare the average weight rank Rk 
with that for the uncorrelated one in which the weights 
are randomly redistributed on the edges to remove the 
correlation between the edge and the weight on it. For 
instance, if Rk is smaller than the value for uncorrelated 
weights, the vertex with degree k can be regarded to have 
an edge with a higher weight as compared with the uncor- 
related situation. This comparison gives an insight into 
the topological correlation of the weights on the network. 
From Fig. |2I a), we find that the extreme vertices, which 
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FIG. 2: (a) The comparison of the average weight rank 7?^ 
of the weighted ER network with that of the uncorrelated 
network (dotted line) generated by the random redistribu- 
tion of weights on the edges, (b) The degree correlation be- 
tween the LST and the original ER network. ER networks 
with N — 4000 vertices and connection probability of edges 
p = 0.004, 0.014, 0.024 are examined, k and a\ denote the av- 
erage and the variance of degrees in the original ER network, 
respectively. The data points are obtained from the averages 
over 256 network ensembles. 



belong to the tail of either the smaller or larger degree 
part in the Poisson degree distribution, have significantly 
higher average weight ranks (small value of Rk) than the 
average rank for the uncorrelated spatial distribution of 
the weights, given by ~ L/2, where L is the total number 
of edges of the original network. This indicates that these 
extreme vertices are more likely to preserve their degree 
in the LSTs. More directly, we also measure the degree 
correlation between the LST and its original network. In 
Fig-G^b), for the extreme vertices, it is confirmed that the 
degrees of the LST show strong correlations with the de- 
grees of the original network. Consequently, the extreme 
vertices become hubs that constitute the heavy tail of the 
inhomogeneous degree distribution of the LST. 

The structural homogeneity of the ER network varies 
with the edge density represented by using the connection 
probability p. As well as the structural properties, the 
characteristics of the shortest path topology also depend 
on the edge density of the ER network. Specifically, the 
change of the connection probability p in the ER network 
gives rise to the change of the degree distribution of its 
LST. At small values of p's, as seen in Fig.^b), the de- 
gree distribution of the LST can be characterized by the 
power-law exponent 7 and the average maximum degree 
kmax- Thus, we can monitor the structural change of the 
LST by looking at the dependence of 7 and k max on the 
connection probability p and the network size N . 

As p increases, 7 monotonically increases, but inter- 
estingly, k max shows non-trivial behavior of decreasing 
even though k max has a chance to increase further as p 
increases because a higher p generates a larger average 
degree in the original ER network. To find a univer- 
sal description of this non-trivial behavior of the degree 
distributions of the LSTs, we perform a size-scaling on 
data obtained from the ER networks of various sizes and 
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FIG. 3: The p dependence of (a) the average maximum de- 
gree k m ax and (b) the power-law exponent 7 in the degree 
distribution of the LST. In (c), the comparison of k m ax with 
the natural cutoff jV 1//( - 7_1 - ) is given. The data points are 
obtained from the averages over 256 ensembles. 



connection probabilities and find out a common scaling 
feature; k max monotonically increases to the maximum 
value of ~ N 1 / 2 until p reaches at ~ N^ 1 / 2 , then it 
decreases as p increases further while 7 monotonically 
increases [see Fig.|3]. 

This p dependence of 7 and k max strongly implies that 
there exists a critical p value at which a clear scale-free 
degree distribution emerges by satisfying the balance be- 
tween 7 and k max . More precisely, because the natural 
cutoff k cuto ff scales with ~ TV 1 /' 7-1 ) in the finite sized 
network of N vertices with power-law degree distribu- 
tion P(k) ~ k~ 1 [22t |2^|. we can test the scale-freeness 
of the degree distribution of the LST depending on p 
by comparing k max with the natural cutoff. From the 
plot of kmax/N 1 ^ 1 ^ 1 ^ as a function of pN 1 / 2 shown in 
Fig. Etc), we find that k max of the LST becomes com- 
parable with the natural cutoff TV 1 ^ 7-1 ' when p reaches 
at N^ 1 / 2 so that the cutoff diminishes in the degree dis- 
tribution, which leads to the emergence of the scale-free 
spanning tree. 

The exponent 7 of the power-law degree distribution 
increases as p increases in the small p region as shown 
in Fig. |3| but the Poisson degree distribution of the orig- 
inal ER network is not recovered even if p gets much 
larger. Interestingly, the average maximum degree k„ 
increases when p becomes far larger than ~ N^ 1 ^ 2 |24 
For intermediate values of p's, the structures of LSTs 
show very diverse degree distribution depending on the 
ensemble of the ER network generation since there can 
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FIG. 4: (a) An example of the very dense network in which 
only one edges (dotted line) are eliminated from the fully 
connected network. The edge-BCs are given on the edges, 
which indicates that the edges of smaller degree vertices have 
more edge-BCs than the other edges, (b) Typical topology of 
the LST for very dense ER network. The structure is obtained 
from the ER network with N = 100 vertices and connection 
probability p = 0.95. 

exist lots of edges that have same value of the edgc-BC 
(weights), which gives large degeneracy in constructing 
LSTs. However, we find that a star-like structure finally 
emerges in the LST [see Fig. 0] as the ER network gets 
very denser to be a nearly fully-connected network. 

In the limiting cases, this can be understood easily. 
Let us consider a nearly fully connected network in which 
only a single pair of vertices are not connected while all 
the other vertex pairs are directly connected by edges 
plj . In this case, the shortest path connecting these two 
vertices must pass through one of their common neigh- 
bor vertices, and consequently this shortest path gives 
additional contributions to the edge-BCs of the edges 
connected to these two vertices having smaller degrees 
[see Fig. a)]. Therefore, in the edge selection for con- 
structing LSTs, the edges connected to the smaller degree 
vertices are chosen with higher priorities, which leads the 



emergence of the star-like LSTs. 

Finally, we note that the emergence of the scale-free 
spanning tree in the ER network also has been reported 
in the two recent works. Clauset et al. Q reported that 
the spanning tree constructed by using traceroutes from 
a single source has power-law degree distribution with 
cutoff. Kalisky et al. |26| presented that the minimum 
spanning tree of the percolation cluster in the ER net- 
work shows the power-law degree distribution. Together 
with these works, our findings indicate that the consid- 
eration of the transport pathways or the weights of the 
edges can drastically changes the original homogeneous 
topology of the ER network. 

In conclusion, we have investigated the structural 
properties of the LST of the weighted ER network in 
which the weight of an edge is given by the edge-BC. 
We have found that the degree distribution of the LST 
shows very inhomogeneous distribution, which can be de- 
scribed as power-law with a cutoff, scale-free, or star-like 
depending on the edge density of the ER network. The 
emergence of these inhomogeneous degree distributions 
in the LST implies that the shortest paths are not homo- 
geneously distributed on the ER network but spatially 
correlated in spite of the homogeneous topology of the 
ER network. 
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